Redundancy > Redundancy Overview > Redundancy Concepts

CygNet Redundancy Concepts

This topic lists and describes the fundamental concepts underlying the CygNet Redundancy application.

See the following subsections below for more information:

Automatic Failover

Automatic failover within the CygNet Redundancy model assures that failed services will switch operation from an active domain to a standby (redundant) domain in the event the active RSM (and its owned services) becomes unavailable. Any RSM running on a domain configured in a local standby or a data-center standby role will constantly monitor the current status of the RSM on the primary active server. If the standby RSM detects that the RSM on the active domain is unavailable for whatever reason, a failover will be automatically triggered.

Bastion Host

A Bastion Host is a live CygNet host located outside the Redundancy environment, but the services on the bastion host are themselves not configured for redundancy. The purpose of the bastion host is to monitor the service and site health of the sites within the redundancy environment. The bastion host can also be configured to send notifications during a failover, and to host a common Audit service across redundant domains. The bastion host should not contain any production services.

A bastion host is an optional, but highly recommended configuration.

Bastion Host Setup Example
Bastion Host Setup Example
Required Services

The bastion host must have an ARS service:

Optional Services

The bastion host may have a SVCMON service:

Recommended Services

Best practice recommends an AUD, BSS, ELS, and GNS service configured on the bastion host. See Services on the Bastion Host for more information.

Firewall

If the bastion host is located on a server outside a firewall, the firewall must be configured to allow UDP traffic through the various service ports.

Control Network

The Control Network is the network that contains the Live Domain, which is the primary domain in your redundancy environment doing the polling. The Control Network is configured as part of the redundancy definition on the Network page of the CygNet Redundancy Editor.

Control Network Example
Control Network Example

Data Center

A Data Center is a group of networked servers typically used for the remote storage, processing, or distribution of large amounts of data. In the diagram above, two data centers are shown: Houston Primary Data Center and Oklahoma Backup Data Center. The Test Network could be in a different data center, or in one of the other two.

Disaster Recovery

Disaster Recovery is the act of recovering data due to a natural or man-made disaster to maintain business continuity. If the currently Active data center has incurred the disaster, a data-center failover may (will likely) be necessary.

Domain

Domain denotes the network port used by the ARS to communicate across the network. See Domains for more information.

Domain Role

The Domain Role specifies the purpose of the domain. Roles include:

The domain role never changes. The Domain Role is configured as part of the redundancy definition on the Domain page of the CygNet Redundancy Editor.

Single Data Center showing the Active Server
Single data center showing the Active server
Single Data Center showing Active Server and Local Standby Server
Single data center showing the Active server
and the Local Standby server
Active and Local Standby servers
Two data centers showing the
Active and Local Standby servers
in the Primary data center and the
Data-Center Standby and the Local Standby servers
in the Backup data center

Failover

Failover is the process of switching the domain on which the active server is running to a standby (or backup/redundant) server. Failover ensures seamless operation in the event of a planned or unplanned interruption in service. The purpose of a redundant environment is to support failover.

Failover can be manual or automatic. Manual failover requires user intervention to initiate and determine the failover process. Automatic failover occurs when a system is configured to automatically switch services from an active server to a standby server in the event of a total system compromise. See Automatic Failover.

High Availability Distributed (HAD) Failover

CygNet services are switched over from host machines in one data center to backup CygNet services running at a separate, geographically distant data center. Often described as Disaster Recover (DR) failover. See Remote Recovery.

High Availability Local (HAL) Failover

CygNet services are switched from host machines in a local data center to backup CygNet services running on another host machine in the same local data center. See Local Recovery.

Live Domain

The Live Domain is the single domain in your redundant environment responsible for collecting data. All data originates in this domain. The live domain lives within the Control Network and could run on any single host in a control network.

Live Domain Example
Live Domain Example

Local Recovery

An RSM in a redundancy environment is expected to restart and fix any failed local service. This is accomplished by configuring a Failover action in Automatic Service Recovery (either in the RSM for each service, or via a mass modification update in the CygNet Redundancy Editor.)

Redundancy Environment

The Redundancy Environment contains all CygNet servers, sites, and domains in a CygNet system that engage in a redundant relationship. The redundancy environment is configured in the CygNet Redundancy Editor.

Redundancy Editor

The CygNet Redundancy Editor is a configuration tool where you can define the relationships between the servers, hosts, and sites in your redundancy environment. The Redundancy Editor is also where you set up the trigger conditions that will initiate the auto-failover process for each domain in the redundancy environment. Once all redundant relationships are known by the system, you can execute failover between machines, stop and start sites, and allow servers to switch the domains on which they are running, and support high availability across data centers.

The editor can be invoked from the RSM in CygNet Explorer via a context menu, or added to a CygNet Studio screen. See CygNet Redundancy Editor.

Redundancy

Redundancy is a system architecture that provides for the duplication of critical services to meet the requirements of high availability. Replication and failover are activities that occur within a redundant system architecture ensuring data integrity is matched between a source and a destination.

Redundant Mode

Redundancy Mode describes the current redundancy state for a service. See REDUNDANT for more information. Options include:

The IN_REDUNDANT_SET info item returns the current redundancy state of the service.

Redundant Relationship

Two or more services connected in a duplicative way are considered to be in a Redundant Relationship. Relationships are configured in the CygNet Redundancy Editor and stored in redundancy definition tables for each participating RSM and synchronized to every RSM that is part of the relationship. Once an RSM has a redundancy definition, it can only be modified via the editor if it's running on the primary domain (as specified in the relationship definition).

The REDUNDANT keyword indicates whether redundancy is enabled for a service and if it is part of a redundant set with other services. The IN_REDUNDANT_SET info item returns whether the service is part of a redundant relationship.

If a service is part of a redundant relationship, it would be invalid to have replication configured at the same time as this value will be passed in from the RSM when needed. A service should not startup with this configuration.

The service will not start up if both redundancy and replication are enabled simultaneously.

Redundant Server Pairs

Redundant server pairs are two partner servers at one location (i.e. in a control room) configured to provide redundant SCADA server capability. See Domain Role above.

Active/Standby Primary/Backup Pairs Example
Active/Standby Primary/Backup Pairs Example

Remote Recovery

In a redundancy environment configured for automatic failover, any standby RSMs (local or data center) will monitor active RSMs for failure. If one or more active RSMs become unavailable (based on auto-failover triggers configured in the CygNet Redundancy Editor), the standby RSM will initiate a failover. A test mode is available to help verify the configured trigger settings. Remote recovery works with both local and data-center redundancy. When enabled, the standby RSM will have a Failover Status of Monitoring.

Remote Service Manager (RSM) Role

The Remote Service Manager (RSM) is the CygNet service that performs all the work related to CygNet Redundancy. Note the following:

Zone

The Zone represents the physical server(s) hosting a single instance of a CygNet site. In a CygNet redundancy environment a zone represents one or more servers running one or more redundant RSMs all operating on the same domain. A zone can be a single server, but it may also represent multiple servers running one or more sites that can be part of a redundant definition. Zones are configured as part of the redundancy definition on the Zone page of the CygNet Redundancy Editor.

Zone Example
Zone Example
Back to top

Let us know how we can improve this topic.

CygNet at weatherford.com

© 2020 Weatherford. All rights reserved.